Picture for Shuo Yang

Shuo Yang

Consolidating Rewarded Perturbations for LLM Post-Training

Add code
May 29, 2026
Viaarxiv icon

LATTE: Forecasting Peer Anchored Preference Trajectories for Personalized LLM Generation

Add code
May 26, 2026
Viaarxiv icon

TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation

Add code
May 25, 2026
Viaarxiv icon

Beyond the Target: From Imitation to Collaboration in Speculative Decoding

Add code
May 24, 2026
Viaarxiv icon

Clipping Bottleneck: Stabilizing RLVR via Stochastic Recovery of Near-Boundary Signals

Add code
May 21, 2026
Viaarxiv icon

One-Way Policy Optimization for Self-Evolving LLMs

Add code
May 21, 2026
Viaarxiv icon

TouchAnything: A Dataset and Framework for Bimanual Tactile Estimation from Egocentric Video

Add code
May 13, 2026
Viaarxiv icon

Active Tabular Augmentation via Policy-Guided Diffusion Inpainting

Add code
May 11, 2026
Viaarxiv icon

LLM Agents Enable User-Governed Personalization Beyond Platform Boundaries

Add code
May 10, 2026
Viaarxiv icon

SuperFace: Preference-Aligned Facial Expression Estimation Beyond Pseudo Supervision

Add code
May 07, 2026
Viaarxiv icon